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Abstract — A multi-relational graph maintains two or more 
relations over a vertex set. This article defines an algebra for 
traversing such graphs that is based on an n-ary relational alge- 
bra, a concatenative single-relational path algebra, and a tensor- 
based multi-relational algebra. The presented algebra provides a 
monoid, automata, and formal language theoretic foundation for 
the construction of a multi-relational graph traversal engine. 

I. Introduction 

The adjacency of vertex i and vertex j is defined by the 
edge A structure of this form is called a graph and is 

usually defined as G = (V , E), where i,j£V are vertices and 
(i, j) € E is the edge adjoining those vertices^] When the only 
distinguishing characteristic between two edges is the vertices 
they join, the graph is called single-relational. The reason for 
this is that there is only a single type of relation in the graph — 
namely, the binary relation E C (V X V). Single-relational 
graphs have been used widely to model various systems of 
homogenous elements related by a single type of relation 
and as such, have numerous algorithms associated with their 
analysis [1]. 

When the domain of discourse is variegated by a heteroge- 
neous set of relations, then the multi-relational graph becomes 
the more applicable construct. A multi-relational graph can be 
defined as G = (V",E), where E is a family of edge sets 
and E = {Ei,E 2 , . . . , E m C (V X V)}. When m > 1, then 
there are multiple relations between the vertices of V . Multi- 
relational graphs not only specify which vertices are adjacent 
to one another, they also specify the way in which they are 
adjacent. With respect to the formalisms of this article and 
without loss of generality, a multi-relational graph can also be 
represented as G — (V,E), where E is ternary relation, E C 
(V x VL x V), and £1 is a set of edge labels (i.e. relation types). 
Thus, in reference to the structure G = (V", E), |E| = \Q\ and 



E 



n<|E| 



E n 



\E\ : E n £ E. The ternary relation model 



is the multi-relational graph structure used throughput this 
article. The reason for the use of this particular G definition 
will be explained in SjTTJ 

Given the growing use of multi-relational graphs in comput- 
ing [2] and the lack of graph techniques for such structures 
(relative to single-relational graphs), an algebraic model for 

'The "high dot" notation denotes that G ^ G ^ G, where G is the main 
definition used throughout the article. 



traversing multi-relational graphs is presented. This article 
can be interpreted as a convergence of the n-ary relational 
algebra of [3], the concatenative single -relational path algebra 
in [4], and the multi -relational tensor algebra presented in [5]. 
However, unlike [3], the presented algebra is tied specifically 
to path construction by means of graph traversals as in [5] 
and [4]. Next, unlike the algebra in [4], which is oriented 
primarily towards single-relational graphs, the presented alge- 
bra conveniently supports multiple relations as in [3] and [5]. 
Finally, unlike [5], the presented algebra is a concatenative, 
order-preserving variation of the relational algebra in [3] and, 
as such, more aligned with [4]. 

The operations presented are summarized in the itemization 
below and are provided here as a consolidated summary for 
ease of reference. 

• || a || : the path length of path a. 

• o : E* x E* ->• E* : the concatenation of two pathsQ 



x N+ 



E: the projection of the n th edge of a 
V: the projection of the tail (first element) 
V: the projection of the head (last element) 



a : E 
path. 

7- : E* 
of a path 
7+ : E* 
of a path. 

lj : E —¥ fi: the projection of the label of an edge 

U : V(E*) x V(E*) 
sets. 

cx : V(E*) x V{E*) 
of two path sets. 



V{E*): the union of two path 
V{E*): the concatenative join 
— > "P(i5*): the concatenative 



. x : V{E*) x V{E*) 
product of two path sets. 

Definitions of these operations are provided in S^TTJ The 
use of these operations to represent basic traversal idioms is 



presented in \ III In S IV regular paths can be recognized and 



generated as demonstrated in ^IV-A and SIV-B respectively. 
Making use of the algebra to evaluate single-relational graph 



algorithms is presented in ^IV-C The algebra provides a set 
of core operations for constructing a multi-relational graph 
traversal engine that is founded on monoid, automata, and 
formal language theory. 



2 The unary Kleene star operation * forms the free monoid E* — U^=o J 
where = {e} and e is the empty /identity element. 



II. Core Operations 



then 



Traversing a graph is the process of moving over the edges 
specified in E. During a traversal, paths are derived and 
properties of those paths can be extracted. 

Definition 1 (Path): A path a in a multi-relational graph is 
a sequence, or string, where a € E* and E C (V x £1 x V). 
A path allows for repeated edges. The path length is denoted 
|| a | and is equal to the number of edges in a. Any edge in E 
is a path with a path length of 1 as e e E C E* . 

The binary operation o : E* x E* — > E* is the concatenation 
of two paths into a new path such that if (i, a, j) and (j, /3, k) 
are two edges in E, then their concatenation is the path 
(i, a, j, j, [3, k), where i, j, k g V and a, (3 E ft. Concatenation 
is associative (i.e. (a o b) o c = a o (6 o c)), not commutative 
(i.e. it is generally true that a o b ^ b o a), and e serves as an 
identity (i.e. eoa = a = aoe). 

Operations exist to extract information out of a path. The 
operation a : E* x N + — s- E is a projection that maps a path to 
the n th edge in that path. For example, if a = (i, a,j,j, /?, k), 
then <r(a, 1) = (i, a, j) and <r(a, 2) = (j, /3, fc). Next, for any 
path, 7~ : E* — > V projects the tail (first vertex) of the path 
such that 7~((i,a,j)) = i. Likewise, 7+ : E* — > V, where 
7 + ((i,a,j)) = j. Similarly, for edge labels, cj ; E — > Q, 
where uj((i,a, j)) — 

Definition 2 (Path Label): The path label of path a is de- 
fined as the edge labels contained in a. Formally, if a is a path, 
then the path label is constructed by uj' : E* — > fi*, where, 
using concatenation, 

n< I a I 

u/(a) = oj(a(a,n)). 

n=l 

The path label of any single edge e £ E is simply the edge's 
label as ||e|| = 1 and o/(e) = u>(a(e, 1)) = w(e). 

The binary operation U : V(E*) x V(E*) ->■ V[E*) 
is standard set union. The binary operation txi : ^(E*) x 
^(E*) — > ^(E*) is the concatenative join of two sets of 
paths such that if A, B e V(E*), then 

AtX B = {a o 6 I a e A A b e B 

A (a = eV6 = eV 7 + (a) = 7^(6))}, 

where 7 + (a) = 7" (6) ensures that only joint (i.e. adjacent) 
paths are concatenated]^] For example, if 

A = {(i,a,j), (j,f3,k,k,a,j)} 

and 

B = {(j,/3,j),(j,f3,i,i,a,k),(i,(3,k)}, 

3 A11 projection operations can be reduced to a single string indexing 
operation, but for the sake of clarity in the following discussion, they are 
presented as being atomic. 

4 The defined concatenative join is analogous to the 0-join in [3], where 

1,1 ■ In this form, its known as an equijoin. A discussion 

7 +(a) =7 (6) 

relating concatenative join and the relational algebra is found in [6]. 



A m„ B = {(i,a,j,j,f3,j), (i,a,j,j,0,i,i,a,k), 
(j,ft,k,k,a,j,j,(3,j), 
(j,ft,k,k,a,j,j,(3,i,i,a,k)}, 

where i,j,k € V, a,P € fi, and (i,a,j), (j,/3,k), (k,a,j), 
{j,P,3),i.3,P,i),{i,ot,k),{i,P,k) e E. Given that dx is 
based on o, txi is associative, but not commutative. 

Definition 3 (Path Jointness): A path is joint is it satisfies 
the characteristic function / : E* — > {T, _L} with the function 
rule 

(T if||a|| = l, 
f(a) = <T if Knli alhl J + (v(a,n))= 1 -(a(a,n + l)), 
I _L otherwise. 

The function maps to T if the path is joint and _L if it is 
disjoint. 

The binary operation M c constructs joint paths. It may be the 
case that traversing disjoint paths is desirable]^] The Carte- 
sian product supports the concatenation of potentially disjoint 
paths. As such, x D : V(E*) x V(E*) V(E*), where 
Ax B = {aob\aeA A be B}. 

Finally, to conclude this section, the reason why the G = 
(V,t = {E u E 2 ,...,E m C (V x V)}) definition of a 
multi-relational graph is not used is because when evaluating 
concatenative joins over binary relations, the edge label infor- 
mation is lost and thus, the path label can not be determined. 
In other words, if e and / are edges from two different binary 
relations, then e o / would only provide a sequence of vertices 
and as such would not specify from which relations the join 
was constructed. This is a deficiency of the algebra in [4], 
where binary relations are used and o : V* x V* — > V* as 
opposed to o : E* x E* ->• E* , where E = (VxOxl^). While 
the algebra in [4] is applicable to multi-relational graphs (as 
any two relations can be joined), it was specifically intended 
for single-relational graphs, where problems involving path 
labels are not considered. In contrast, the specification defined 
in this article preserves path labels. 

III. Basic Traversals 

From the explicit adjacencies (edges) defined in the edge set 
E, there exists implicit adjacencies (paths) defined by e o /, 
where e, / € E and e o / e E* . Given the previously defined 
operations, different types of common traversal idioms can be 
affected. 

A. Complete Traversal 

All joint paths through a graph of length n can be con- 
structed using E cxi . . . txi E. This type of traversal is called 

v v ' 

n times 

a complete traversal because there is no discrimination when 
joining except that the join vertex (i.e. the head of the first 
path and tail of the second) be equal. When it is desirable 

5 For example, priors-based algorithms require the concept of "teleportation" 
in order to make a disjoint jump in the graph. 



to limit the set of paths derived by the traversal then the sets 

A. B C E need to be defined and joined. 

B. Source Traversal 

A source traversal emanates from a particular set of vertices. 
Such a traversal is left restricting as it constructs paths whose 
tail vertex is an element of V s C V . The first concatenative 
join must, on its left side, contain the set of all edges in E 
that have their tail vertex in V s . Therefore, when 

A = {e\eeE A 7~(e) £ V s }, 

A c<i E . . . M E yields all joint paths of length n emanating 

n times 

from the vertices in V s . When V s = V, a complete traversal 
is evaluated since A = E. For ease of expression, the 
complement of the set V s can be used to denote where not 
to start a traversal from. For example, V s — V \ V S states to 
start the traversal from all vertices in V except those in V s . 

C. Destination Traversal 

A destination traversal is similar to a source traversal, except 
that it is right restricting as it constructs all paths of length 
n whose head, or terminal, vertex is in Vd C V. In this way, 
when 

B = {e\eeE A 7 +(e) G V d }, 
E N . . . E M B is a destination traversal. When Vd = V, 

V v ' 

n times 

a complete traversal is evaluated because B = E in such 
situations. 

By combining a source and destination traversal, its possible 
to emanate from particular vertices and arrive at particular 
vertices, where A xi E . . . E txi B is the set of all joint paths 

s V ' 

n times 

that start from vertices in V s , end at vertices in V d , and are of 
length n. Source and destination traversals can also be used 
to ensure that each edge in the path goes through a particular 
set of vertices by specifying, at some particular X Q step, the 
source (or destination) vertex set as V s (or Vd) before enacting 
the next concatenative join. 

D. Labeled Traversal 

A traversal can be constrained to particular path labels by 
defining an edge set that is a function of its edge labels. For 
example, if fi e C ft, £lf C fi, 

A = {e | e G E A w(e) G ft e }, 

and 

B = {f\f€E A w(/)en,}, 

then A tx Q B denotes all paths where oj(a(a, 1)) G O e and 
cj(a(a, 2)) G 0/. When f2 e = flf = il, a complete traversal 
is enacted as, in such situations, A = B = E. The labeled 
traversal is possible because the relation type is represented 
in the edge definition E C (V x x V) and there exists the 
label projection function lu : E — > ft. 



IV. Derivative Traversals 

The basic traversals defined in j jlTT| can be mixed and 
matched to yield different types of joint paths in E* . This sec- 
tion will introduce some typical applications of the presented 
multi-relational path algebra to problems that are specific 
to multi-relational graphs — focusing primarily on problems 
involving regular paths |j 

A. Regular Path Recognizer 

The presented multi-relational path algebra has application 
to regular expressions and their corresponding finite state au- 
tomata. Before presenting this application, an example-specific 
set-builder notation is introduced in order to specify subsets 
of £ in a more concise, readable manner than previously 
presented. A source edge set can be specified as [£, _, _] = 

UaenUjevfo «>.?') : (*> a >i) e E in order t0 denote the 
set of all edges that emanate from vertex i. A destination 
edge set can be specified as [_, = |J ie y U ae o(^ a >i) : 
(i,a,j) £ E in order to denote the set of all edges that 
terminate at vertex j. A labeled edge set can be specified 

as [_,a, _] = U<evUjev-(*' a 'i) : e E in order t0 

denote the set of all edges that have a as their label. Finally 

[-,-,-] = E. 

If E is the regular expression alphabet, then 0, e, and any 
e G E are regular expressions. If R and Q are regular expres- 
sions, then R U Q, R cxi Q, and R* are regular expressions 
[7]Qa regular expression over E, and corresponding finite 
state automaton, recognize a set of joint paths in "P(_E*)[^For 
example, 

[£,a,_]M l,8,_]*(([_,a,j]t< {{j,a,i)}) U [_,a,fc]) 

recognizes all paths emanating from i, terminating at i or k, 
with the first and last label traversed being a, and all interme- 
diate edge labels (zero or more) being 0, The corresponding 
finite state automaton is diagrammed in Figure [T[ where the 
transition function is based on set membership, not equality]^] 
Regular paths in graphs are explored in depth in [8], where 
only paths with particular path labels are considered for 
recognition. In other words, in [8], a regular expression is 
defined for the alphabet Q, where above, its defined for E. 

B. Regular Path Generator 

By making use of a non-deterministic single-stack au- 
tomaton with a stack alphabet of V(E*), it is possible to 

6 For the sake of simplicity, only regular paths are discussed. However, 
with more machinery (e.g. memory structures), more complex traversals can 
be expressed using the core operations presented in j jTT| 

7 The X o operation can be used to recognize potentially disjoint paths, but 
in practice, when only joint paths are being recognized then Mo is a more 
efficient use of resources as R M Q C R X Q- 

8 The common operations R?, and R n used in practice can be 

represented as R tx3 R*, R U {e}, and R <x} ■ ■ ■ Xlo R, respectively. 



'Given that set membership can be represented element- wise as element 
equality under or, each element of the transition label edge set can be 
individually denoted as a transition with the same tail and head state. As 
such, the typical finite state automaton transition exists. For diagram clarity, 
set membership is used instead of equality. 




Fig. 1. A finite state automaton to recognize and generate a set of paths 
in V(E*). The left most state is the start state and the double-circle states 
denote accepting states. 



generate all paths in G that can be recognized by some regular 
expression. The non-deterministic aspect of the automaton 
ensures that all branches in the state machine are taken "in 
parallel." The single-stack aspect refers to the fact that the 
automaton (and thus, its cloned/branched automata) maintain 
a first-in/last-out stack memory that can be pushed and popped. 

Initially, the automaton's stack contains the element {e}. 
The automaton will halt whenever its stack element is or is 
in an accepting state. For each state transition (which happens 
unless the automaton has been halted), the path set defined 
on the transition label is joined on the right with the path set 
popped off the stack. The result of the join is then pushed back 
onto the stack. Whenever a branch in the automaton's state 
graph is approached, all branches are taken "in parallel." Thus, 
given the automaton diagrammed in Figure [T] the following 
joins are evaluated. 

{e} M„ [i, a, _] M D [_, a,j] M D {{j, a, i)} 

{e} M [i, a, _] txi [_, a, k] 

{e} M„ [i,a,_] M D [_,£,-]••• Mo [-,a,j] M D {{j,a,i)} 

{e} Mo [i, a, _] M [_, P, _] . . . M [_, a, k] 

The union of the first (and only) element of all the stacks 
across all branches of accept-state automaton forms the set of 
all paths in G that satisfy the regular expression. 

C. Constructing Semantically-Rich Single-Relational Graphs 

Most of the graph algorithms in existence today have 
been developed for single-relational graphs. Examples of such 
algorithms include the geodesies (e.g. closeness centrality, 
betweenness centrality), spectral (e.g. eigenvector centrality, 
spreading activation), and assortative (e.g. scalar and discrete) 
algorithms (see [1] for a consolidate review and analysis 
of many such algorithms). When applied to multi-relational 
graphs, these algorithms have the potential drawback of losing 
their meaning and thus, their applicability. To explicate this 
statement, it is important to consider the way in which a 
single-relational graph algorithm can be formally applied to 
multi-relational graphs. One method that can be employed 
is to simply ignore edge labels and, potentially, repeated 
edges between the same two vertices. However, when there 
are numerous ways in which one vertex can be related to 
another vertex, what is the resulting semantics of, say, a 



centrality algorithm? Another method is to extract a single 
edge relation, based on its label, from the multi-relational 
graph. For example, its possible to construct the binary edge 
set 

E a = {(7"(e),7 + (e)) I e e E A w(e) - a} 

and utilize that subgraph as the source of a single-relational 
graph algorithm. However, with multiple ways in which 
vertices can be related, more abstract relationships can be 
inferred through paths. Thus, in the final method, single- 
relational graphs can be generated from the multi-relational 
graph through the derivation of implicit edges defined through 
paths. Using a simple example, if a, (3 s 51 are two edge 
labels, then all a/3-paths can be constructed when A = 
{e | e € E A w(e) = a}, B = {e \ e G E A w(e) = /3} and 
A M B. The tail and head vertices of these paths can then 
be projected to form a new binary edge set 

E <*f>= (J (7~(a),7 + (a))- 

Thus, E a fj C (V X V) can be subjected to all known 
single-relational graph algorithms. For regular paths, a regular 

Mapping single- 



path generator can be used as in [IV-B 



relational graph algorithms over to the multi-relational domain 
is explored in depth in [5], 

V. Conclusion 

This article defined a path algebra for multi-relational 
graphs represented as G = (V,EC (V x Q x V). The 
core traversal types (complete, source, destination, and la- 
beled) allow for the expression of more expressive traversals 
through the restriction of the join set E. Applications to 
regular path recognizers ({ IV-A) , generators (SIV-Bi, and 
"semantically-rich" single-relational graph construction IV- 
[C) were presented. Generally, the algebra has applicability to 
the construction of a multi-relational graph traversal engine. 
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